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Abstract 

The differential branching fraction of the rare decay )• An'^n~ is measured as a 
function of q^, the square of the dimuon invariant mass. The analysis is performed 
using proton-proton collision data, corresponding to an integrated luminosity of 
3.0 fb“^, collected by the LHCb experiment. Evidence of signal is observed in the 
region below the square of the J/^|J mass. Integrating over 15 < < 20 GeV^/c^ 

the branching fraction is measured as 

Aij.^fj.-)/dq‘^ = (1.18 0.03 ±0.27) x 10"^ {Ge\^/c^)-\ 

where the uncertainties are statistical, systematic and due to the normalisation 
mode, A^ —>■ J/ipA, respectively. In the q^ intervals where the signal is observed, 
angular distributions are studied and the forward-backward asymmetries in the 
dimuon (Apg) and hadron (Apg) systems are measured for the first time. In the 
range 15 < < 20 GeVyc"^ they are found to be 

App = —0.05 ± 0.09 (stat) ± 0.03 (syst) and 
App = —0.29 ±0.07 (stat) ±0.03 (syst). 
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1 Introduction 


The decay is a rare (&—)■ s) flavour-changing nentral current process that, 

in the Standard Model (SM), proceeds throngh electroweak loop (penguin and box) 
diagrams. As non-SM particles may also contribute to the decay amplitudes, measurements 
of this and similar decays can be nsed to search for physics beyond the SM. To date, 
emphasis has been placed on the study of rare decays of mesons rather than baryons, in 
part due to the theoretical complexity of the latter (^. In the particular system studied in 
this analysis, the decay products include only a single long-lived hadron, simplifying the 
theoretical modelling of hadronic physics in the final state. 

The study of baryon decays is of considerable interest for several reasons. Firstly, as 
the A^ baryon has non-zero spin, there is the potential to improve the limited understanding 
of the helicity structure of the underlying Hamiltonian, which cannot be extracted from 
meson decays . Secondly, as the A^ baryon may be considered as consisting of a heavy 
quark combined with a light diquark system, the hadronic physics differs significantly from 
that of the B meson decay. A farther motivation specific to the A° —)■ A/r■*■/!“ channel is 
that the polarisation of the A baryon is preserved in the A —)■ pvr” decaj0 giving access to 
complementary information to that available from meson decays [^. 

Theoretical aspects of the A° —)■ decay have been considered both in the SM 

and in some of its extensions |3]-[l6]. Althongh based on the same effective Hamiltonian 
as that for the corresponding mesonic transitions, the hadronic form factors for the A^ 
baryon case are less well-known dne to the less stringent experimental constraints. This 
leads to a large spread in the predicted branching fractions. The decay has a non-trivial 
angnlar strncture which, in the case of nnpolarised A° prodnction, is described by the 
helicity angles of the mnon and proton, the angle between the planes defined by the A 
decay prodncts and the two mnons, and the sqnare of the dimnon invariant mass, q^. 
In theoretical investigations, the differential branching fraction, and forward-backward 
asymmetries for both the dilepton and the hadron systems of the decay, have received 
particular attention [3|n T5-^. Different treatments of form factors are used depending 
on the region and can be tested by comparing predictions with data as a function of q^. 

In previous observations of the decay A° —)■ 1^ ,19 , evidence for signal had been 


limited to q^ values above the square of the mass of the 'ip{2S) resonance. This region will 
be referred to as “high-g^”, while that below the 'ijj{2S) will be referred to as “low-g^”. In 
this paper an updated measurement by LHCb of the differential branching fraction for the 
rare decay )■ Ap+/r“, and the first angular analysis of this decay mode, are reported. 
Non-overlapping intervals in the range 0.1-20.0 GeVyc"^, and theoretically motivated 
ranges 1.1-6.0 and 15.0-20.0GeV7c^ are used. The rates are normalised with 

respect to the tree-level 6—)- ccs decay A°—)■ Jj'^A, where J/^ —>■ This analysis uses 

pp collision data, corresponding to an integrated Inminosity of 3.0 fb~^, collected dnring 
2011 and 2012 at centre-of-mass energies of 7 and 8TeV, respectively. 


Mhe inclusion of charge-conjugate modes is implicit throughout. 
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2 Detector and simulation 


The LHCb detector [22p^ is a single-arm forward spectrometer covering the pseudorapidity 
range 2 < r/ < 5, designed for the study of particles containing 6 or c quarks. The detector 
includes a high-precision tracking system (VELO) consisting of a silicon-strip vertex 
detector surrounding the pp interaction region j^, a large-area silicon-strip detector 
located upstream of a dipole magnet with a bending power of about 4Tm, and three 
stations of silicon-strip detectors and straw drift tubes 25 placed downstream of the 
magnet. The tracking system provides a measurement of momentum, p, with a relative 
uncertainty that varies from 0.5% at low momentum to 1.0% at 200GeV/c. The minimum 
distance of a track to a primary vertex, the impact parameter, is measured with a 
resolution of (15 -|- 29 /pt) O-m, where px is fbe component of the momentum transverse to 
the beam, in GeV/c. Different types of charged hadrons are distinguished using information 
from two ring-imaging Cherenkov (RICH) detectors 26 . Photon, electron and hadron 


candidates are identihed using a calorimeter system that consists of scintillating-pad and 
preshower detectors, an electromagnetic calorimeter and a hadronic calorimeter. Muons 
are identified by a system composed of alternating layers of iron and multiwire proportional 
chambers 27 . 

consists of a hardware stage, based on information from the calorimeter 


The trigger 28 


and muon systems, followed by a software stage in which a full event reconstruction is 
carried out. Candidate events are first required to pass a hardware trigger, which selects 
muons with a transverse momentum px > 1.48 GeV/c in the 7TeV data or px > 1.76 GeV/c 
in the 8TeV data. In the subsequent software trigger, at least one of the final-state 
charged particles is required to have both px > 0.8GeV/c and impact parameter greater 
than 100 pm with respect to all of the primary pp interaction vertices (PVs) in the event. 
Finally, the tracks of two or more of the final-state particles are required to form a vertex 
that is significantly displaced from the PVs. 

Simulated samples of pp collisions are generated using Pythia 29 with a specific 
LHCb configuration 1^. Decays of hadronic particles are described by EvtGen 


The interaction of the 


in which final-state radiation is generated using Photos 
generated particles with the detector, and its response, are implemented using the Geant4 

The model used in the simulation of —)• ApApi~ 

(iG, together with Wilson 


toolkit 33 


34 


as described in Ref. 
decays includes and angular dependence as described in Ref. 


coefficients based on Refs. 354^. Interference effects from J/A and '0(25') contributions 


are not included. For the 7l°—)■ JjAA decay the simulation model is based on the angular 
distributions observed in Ref. 1^. 


3 Candidate selection 

Candidate A^ —)■ Ap^p~ (signal mode) and A^ —)■ J/0 A (normalisation mode) decays are 
reconstructed from a A baryon candidate and either a dimuon or a J/ip meson candidate, 
respectively. The Hj) —)■ J/ip A mode, with the J/0 meson reconstructed via its dimuon 
decay, is a convenient normalisation process because it has the same final-state particles 
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as the signal mode. Signal and normalisation channels are distinguished by the interval 
in which they fall. 

The dimuon candidates are formed from two well-reconstructed oppositely charged 
particles that are signihcantly displaced from any PV, identihed as muons and consistent 
with originating from a common vertex. 

Candidate A decays are reconstructed in the A —)■ p7i~ mode from two oppositely 
charged tracks that either both include information from the VELO {long candidates), 
or both do not include information from the VELO {downstream candidates). The A 
candidates must also have a vertex ht with a good a decay time of at least 2ps and an 
invariant mass within 30MeV/c^ of the known A mass [^. For long candidates, charged 
particles must have px > 0.25 GeV/c and a further requirement is imposed on the particle 
identification (PID) of the proton using a likelihood variable that combines information 
from the RICH detectors and the calorimeters. 

Candidate A^ decays are formed from A and dimuon candidates that have a combined 
invariant mass in the interval 5.3-7.0 GeV/c^ and form a good-quality vertex that is well- 
separated from any PV. Candidates pointing to the PV with which they are associated 
are selected by requiring that the angle between the A^ momentum vector and the vector 
between the PV and the A^ decay vertex, Ojy, is less than 14mrad. After the A^ candidate 
is built, a kinematic ht of the complete decay chain is performed in which the proton 
and pion are constrained such that the pvr” invariant mass corresponds to the known 
A baryon mass, and the A and dimuon systems are constrained to originate from their 
respective vertices. Furthermore, candidates falling in the 8 — 11 and 12.5 — 15 GeV^c^ 
intervals are excluded from the rare sample as they are dominated by decays via J/'0 and 
'ip{2S) resonances. 

The hnal selection is based on a neural network classiher 40,41 , exploiting 15 variables 
carrying kinematic, candidate quality and particle identihcation information. Both the 
track parameter resolutions and kinematic properties are different for downstream and 
long A decays and therefore a separate training is performed for each category. The 
signal sample used to train the neural network consists of simulated A^ —>■ Aqi~^fi~ events, 
while the background is taken from data in the upper sideband of the A^ candidate mass 
spectrum, between 6.0 and 7.0GeV/c^. Gandidates with a dimuon mass in either the J/'ip 
or ijj{2S) regions (±100 MeV/c^ intervals around their known masses) are excluded from 
the training samples. The variable that provides the greatest discrimination in the case of 
long candidates is the from the kinematic ht. For downstream candidates, the px of the 
A candidate is the most powerful variable. Other variables that contribute signihcantly 
are: the PID information for muons; the separation of the muons, the pion and the A^ 
candidate from the PV; the distance between the A and 71° decay vertices; and the pointing 
angle, 6d- 

The requirement on the response of the neural network classiher is chosen separately 
for low- and high-g^ candidates using two different hgures of merit. In the low-g^ region, 
where the signal has not been previously established, the hgure of merit e/{\/NB + a/2) 42 
is used, where £ and N-g are the signal efficiency and the expected number of background 
decays and a is the target signihcance; a value of a = 3 is used. In contrast, for the 
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high-g^ region the figure of merit Ns/\/Ns + is maximised, where is the expected 
number of signal candidates. To ensure an appropriate normalisation of N^, the number 
of — )■ J/'ip A candidates that satisfy the preselection is scaled by the measured ratio 
of branching fractions of —)■ to —>■ J/'0(—>■ ijAh~)A decays 19 , and the 

J/ip — )■ branching fraction j^. The value of N-q is determined by extrapolating 

the number of candidate decays found in the background training sample into the signal 
region. Relative to the preselected event sample, the neural network retains approximately 
96 % (66 %) of downstream candidates and 97 % (82 %) of long candidates for the selection 
at high (low) q^. 


4 Peaking backgrounds 

In addition to combinatorial background formed from the random combination of particles, 
backgrounds due to specific decays are studied using fully reconstructed samples of 
simulated b hadron decays in which the final state includes two muons. For the 71° —)■ J/xfj A 
channel, the only significant contribution is from i?°—)■ J/'ipK^ decays, with TFg —)■ 7r’''7r“ 
where one of the pions is misidentified as a proton. This decay contains a long-lived 77° 
meson and therefore has the same topology as the 7l° —>■ J/'ip A mode. This contribution 
leads to a broad shape that peaks below the 7l° mass region, which is taken into account 
in the mass fit. 

For the 7l° —)■ A/jA channel two sources of peaking background are identified. The 
first of these is 71° —)■ J/A A decays in which an energetic photon is radiated from either of 
the muons; this constitutes a background in the region just below the square of the J/A 
mass and in a mass region significantly below the 7l° mass. These events do not contribute 
significantly in the intervals chosen for the analysis. The second source of background 
is due to i?°—)■ RTg/r■*■/!“ decays, where 7F°—)■ 7r’''7r“ and one of the pions is misidentified 
as a proton. This contribution is estimated by scaling the number of 5° —)■ J/ip iF° events 
found in the 7l° —)■ J/A A fit by the ratio of the world average branching fractions for the 
decay processes i?°—)■ iF°/i+/i“ and i?°—)■ J/0 (—)■ /i+/i“)iF° j^. Integrated over this 
is estimated to yield fewer than ten events, which is small relative to the expected total 
background level. 


5 Yields 

5.1 Fit procedure 

The yields of signal and background events in the data are determined in the mass range 
5.35-6.00GeV/c^ using unbinned extended maximum likelihood fits for the 71°—)- Tl/i+zi” 
and the 71° —)■ J/A A modes. The likelihood function has the form 

N 

c = X n[^sl"s(m,) -h iVcPc(m,) + iVpPp(m,)] , (1) 
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where Ns, Nq and Np are the number of signal, combinatorial and peaking background 
events, respectively, Pj{mi) are the corresponding probability density functions (PDFs) 
and mi is the mass of the candidate. The signal yield itself is parametrised in the fit 
using the relative branching fraction of the signal and normalisation modes. 


Ns{,Aii^^ )k 


W/M) 


■Ns{mA)u-et- 


Aq^ 


( 2 ) 


where k is the candidate category (long or downstream), Aq^ is the width of the q^ interval 
considered and is the relative efficiency, hxed to the values obtained as described in 
Sec. Fitting the ratio of the branching fractions of signal and normalisation modes 
simultaneously in both candidate categories makes better statistical use of the data. 

The signal shape, in both —)■ and A^ —>■ J/'ijjA modes, is described by the sum 

of two Crystal Ball functions that share common means and tail parameters but have 
independent widths. The combinatorial background is parametrised by an exponential 
function, independently in each q‘^ interval. The background due to —)■ J/ijj Kg decays 
is modelled by the sum of two Crystal Ball functions with opposite tails. All shape 
parameters are independent for the downstream and long sample. 

For the Aj)—)• J/ipA mode, the widths and common mean in the signal parametrisation 
are free parameters. The parameters describing the shape of the peaking background are 
hxed to those derived from simulated 5°—)- J/'ipKg decays, with only the normalisation 
allowed to vary to accomodate differences between data and simulation. 

For the A^ —)■ decay, the signal shape parameters are hxed according to the result 

of the ht to A° — A data and the widths are rescaled to allow for possible diherences 
in resolution as a function of q^. The scaling factor is determined comparing )■ J/'ijjA 
and A° —)■ Afi~^fi~ simulated events. The —)■ ATg/i+zi” background component is also 
modelled using the sum of two Crystal Ball functions with opposite tails where both the 
yield and all shape parameters are constrained to those obtained from simulated events. 


5.2 Fit results 

The invariant mass distribution of the A° — )■ J/'ip A candidates selected with the high-g^ 
requirements is shown in Fig. combining both long and downstream candidates. The 
normalisation cha nn el candidates are divided into four sub-samples: downstream and 
long events are htted separately and each sample is selected with both the low-g^ and 
high-g^ requirements to normalise the corresponding g^ regions in signal. The number of 
A° —)■ J/jp A decays found in each case is given in Table 

The fraction of peaking background events is larger in the downstream sample amount¬ 
ing to 28 % of the A° —)■ J/'ip A yield in the full htted mass range, while in the sample of 
long candidates it constitutes about 4 %. 

The invariant mass distributions for the Aj)—)■ A/r■*■/!“ process, integrated over 15.0 < 
g^ < 20.0GeV^/c^ and in eight separate g^ intervals, are shown in Figs. and The 
yields found in each g^ interval are given in Table together with their signihcances. The 
statistical signihcance of the observed signal yields is evaluated as \/2A\njC, where A In £ 
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Table 1: Number of J/ij^A decays in the long and downstream categories found using the 
selection for low- and high-g'^ regions. Uncertainties shown are statistical only. 


Selection iVg (long) iVg (downstream) 
high-g^ 4313 ± 70 11497 ± 123 

low-g2 3363 ± 59 7225 ± 89 



Figure 1: Invariant mass distribution of the —)• J/'A A candidates selected with the neural 
network requirement used for the high-g^ region. The (black) points show data, combining 
downstream and long candidates, and the solid (blue) line represents the overall fit function. 
The dotted (red) line represents the combinatorial and the dash-dotted (brown) line the peaking 
background from ^ J/'ipKg decays. 


is the change in the logarithm of the likelihood fnnction when the signal component is 
excluded from the £t, relative to the nominal fit in which it is present. 

6 Relative efficiency 

The measurement of the differential branching fraction of A^ —)■ relative to A^ —)■ 

J/xIjA benehts from the cancellation of several potential sources of systematic uncertainty 
in the ratio of efficiencies, = £tot(7l° —)■ 7l/r'''/i“)/etot(7l° —)■ A). Due to the long 

lifetime of A baryons, most of the candidates are reconstructed in the downstream category, 
with an overall efficiency of 0.20 %, while the typical efficiency is 0.05 % for long candidates. 
The efficiency of the PID is obtained from a data-driven method and found to 
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M(A|X|i) [MeV/c^] 

Figure 2: Invariant mass distribution of the )• candidates, integrated over the region 

15.0 < < 20.0 GeV^/c^ together with the fit function described in the text. The points show 

data, the solid (blue) line is the overall fit function and the dotted (red) line represents the 
combinatorial background. The background component from )• decays, (brown) 

dashed line, is barely visibile due to the very low yield. 


Table 2: Signal decay yields (Nq) obtained from the mass fit to —)■ candidates in 

each interval together with their statistical significances. The yields are the sum of long and 
downstream categories with downstream decays comprising ~ 80 % of the total yield. The 8 — 11 
and 12.5 — 15 GeV^/c^ q^ intervals are excluded from the study as they are dominated by decays 
via charmonium resonances. 


interval [ GeV^] 

Total signal yield 

Significance 

0.1 - 2.0 

16.0 ±5.3 

4.4 

2.0 - 4.0 

4.8 ±4.7 

1.2 

4.0 - 6.0 

0.9 ±2.3 

0.5 

6.0 - 8.0 

11.4 ±5.3 

2.7 

11.0 - 12.5 

60 ± 12 

6.5 

15.0 - 16.0 

57 ±9 

8.7 

16.0 - 18.0 

118 ± 13 

13 

18.0 - 20.0 

100 ± 11 

14 

1.1 - 6.0 

9.4 ±6.3 

1.7 

15.0 - 20.0 

276 ± 20 

21 
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M(Ann) [MeV/c^] MCAp-n) [MeV/c^] 

Figure 3: Invariant mass distributions of candidates, in eight intervals, together 

with the fit function described in the text. The points show data, the solid (blue) line is the 
overall fit function and the dotted (red) line represents the combinatorial background component. 
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be 98 % while all other efficiencies are evaluated using simulated data. The models used 
for the simulation are summarised in Sec. The trigger efficiency is calculated using 
simulated data and increases from approximately 56 % to 86 % between the lowest and 
highest regions. An independent cross-check of the trigger efficiency is performed using a 
data-driven method. This exploits the possibility of categorising a candidate —)• A/i+zi” 
or A° —)• J/ijj A decay in two ways depending on which tracks are directly responsible for its 
selection by the trigger; “trigger on signal” candidates, where the tracks responsible for 
the hardware and software trigger decisions are associated with the signal; and “trigger 
independent of signal” candidates, with a baryon reconstructed in either of these 
channels but where the trigger decision does not depend on any of their decay products. 
As these two categories of event are not mutually exclusive, their overlap may be used to 
estimate the efficiency of the trigger selection using data. Using A° —>■ J/'ijjA candidates and 
calculating the ratio of yields that are classified as both trigger on signal and independent 
of signal, relative to those that are classffied as trigger independent of signal, an efficiency 
of (70 ± 5) % is obtained, which is consistent with that of (73.33 ± 0.02) % computed from 
simulation. 

The relative efficiency for the ratio of branching fractions in each interval, calculated 
from the absolute efficiencies described above, is shown in Fig. The increase in efficiency 
as a function of increasing g^ is dominated by two effects. Firstly, at low g^ the muons have 
lower momenta and therefore have a lower probability of satisfying the trigger requirements. 
Secondly, at low g^ the A baryon has a larger fraction of the A° momentum and is more 
likely to decay outside of the acceptance of the detector. Separate selections are used for 
the low- and high-g^ regions and, as can be seen in Fig. the tighter neural network 
requirement used in the low-g^ region has a stronger effect on downstream candidates. 

The uncertainties combine both statistical and systematic contributions (with the 
latter dominating) and include a small correlated uncertainty due to the use of a single 
simulated sample of >■ J/z/^A decays as the normalisation channel for all g^ intervals. 
Systematic uncertainties associated with the efficiency calculation are described in detail 
in Sec. [7l 

7 Systematic uncertainties on the branching fraction 

7.1 Yields 

Three sources of systematic uncertainty on the measured yields are considered for both the 
A°—)■ J/ipA and the A°—)■ Ap+p“ decay modes: the shape of the signal PDF, the shape of 
the background PDF and the choice of the hxed parameters used in the hts to data. 

For both decays, the default signal PDF is replaced by the sum of two Gaussian functions. 
All parameters of the Gaussian functions are allowed to vary to take into account the effect 
of hxing parameters. The shape of the background function is changed by permitting the 
peaking background yield, which is hxed to the value obtained from simulation 
the nominal ht, to vary. For the resonant channel, the peaking background shape 

is changed by hxing the global shift to zero. Finally, simulated experiments are performed 
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Figure 4: Total relative efficiency, Ereh between —)• and A^ —)• Ajil) A decays. The 

uncertainties are the combination of both statistical and systematic components, and are 
dominated by the latter. 


using the default model, separately for each interval, generating the same number of 
events as observed in data. Each distribution is htted with the default model and the 
modihed PDFs. The average deviation over the ensemble of simulated experiments is 
assigned as the systematic uncertainty. The relative change in signal yield due to the 
choice of signal PDF varies between 0.6 % and 4.6 % depending on while the change due 
to the choice of background PDF is in the range between 1.1 % and 2.5 %. The intervals 
that are most affected are those in which a smaller number of candidates is observed and 
therefore there are fewer constraints to restrict potentially different PDFs. The systematic 
uncertainties on the yield in each q^ interval are summarised in Table where the total is 
the sum in quadrature of the individual components. 

7.2 Relative efficiencies 

The dominant systematic effect is that related to the current knowledge of the angular 
structure and the dependence of the decay channels. The uncertainty due to the finite 
size of simulated samples is comparable to that from other sources. The total systematic 
uncertainties on the efficiencies, calculated as the sums in quadrature of the individual 
components described below, are summarised in Table 
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7.2.1 Decay structure and production polarisation 


The main factors that affect the detection efficiencies are the angnlar strnctnre of the 
decays and the production polarisation (P^). Although these arise from different parts of 
the process, the efficiencies are linked and are therefore treated together. 

For the —)■ decay, the impact of the limited knowledge of the production po¬ 

larisation, Pfo, is estimated by comparing the default efficiency, obtained in the unpolarised 
scenario, with those in which the polarisation is varied within its measured uncertainties, 
using the most recent LHCb measurement, Pb = 0.06 ± 0.09 (^. The larger of these 
differences is assigned as the systematic uncertainty from this source. This yields a ~ 0.5 % 
uncertainty on the efficiency of downstream candidates and ~ 1.2 % for long candidates. 
No significant dependence is found. 

To assess the systematic uncertainty due to the limited knowledge of the decay structure, 
the efficiency corresponding to the default model 16,35,44 is compared to that of a 


model containing an alternative set of form factors based on a lattice QCD calculation 15 


The larger of the full difference or the statistical precision is assigned as the systematic 
uncertainty. 

For the )■ J/'ipA mode, the default angular distribution is based on that observed 
in Ref. 37 . The angular distribution is determined by the production polarisation and 
four complex decay amplitudes. The central values from Ref. are used for the nominal 
result. To assess the sensitivity of the A^ —)■ J/'ipA mode to the choice of decay model, the 
production polarisation and decay amplitudes are varied within their uncertainties, taking 
into account correlations. 

To assess the potential impact that physics beyond the SM might have on the detection 
efficiency, the Cr and Cg Wilson coefficients are modified by adding a non-SM contribution 
{Ci —>■ Cj -t- Cj). The Cj added are inspired to maintain compatibility with the recent 
LHCb result for the Pg observable 45 and indicate a change at the level of ~ 7 % in the 
0.1-2.0 interval, and 2-3% in other regions. No systematic is assigned as a result of 
this study. 


7.2.2 Reconstruction efficiency for the A baryon 

The A baryon is reconstructed from either long or downstream tracks, and their relative 
proportions differ in data and simulation. This proportion does not depend significantly 
on and therefore possible effects cancel in the ratio with the normalisation channel. 
Furthermore, since the analysis is performed separately for long and downstream candidates, 
it is not necessary to assign a systematic uncertainty to account for a potential effect due to 
the different fractions of candidates of the two categories observed in data and simulation. 
To allow for residual differences between data and simulation that do not cancel completely 
in the ratio between signal and normalisation modes, systematic uncertainties of 0.8% 
and 1.2 % are estimated for the low-g^ and high-g^ regions, respectively, using the same 
data-driven method as in Ref. 


46 
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Table 3: Systematic uncertainties as a function of assigned for yields and efficiencies. Values 
reported are the sums in quadrature of all contributions evaluated within each category. 


g^ interval | 

;GeVVc^ ] 

Syst. on yields [%] 

Syst. on eff. [%] 

0.1 - 

2.0 

3.4 

+ 2.2 

- 3.6 

2.0 - 

4.0 

3.8 

+ 2.2 

- 4.1 

4.0 - 

6.0 

6.6 

+ 17.2 

- 14.3 

6.0 - 

8.0 

2.0 

+ 2.1 

- 3.1 

11.0 - 

12.5 

3.2 

+ 3.7 

- 5.2 

15.0 - 

16.0 

2.8 

+ 3.1 

- 2.8 

16.0 - 

18.0 

1.4 

+ 3.0 

- 4.1 

18.0 - 

20.0 

2.5 

+ 3.9 

- 2.3 

1.1 - 

6.0 

4.2 

+ 2.2 

- 4.6 

15.0 - 

20.0 

1.0 

+ 2.0 

- 2.9 


7.2.3 Production kinematics and lifetime of the baryon 

In —)■ J/'ijj A decays a small difference is observed between data and simulation in the 
momentum and transverse momentum distributions of the A^ baryon produced. Simulated 
data are reweighted to reproduce these distributions in data and the relative efficiencies 
are compared to those obtained using events that are not reweighted. This effect is less 
than 0.1 %, which is negligible with respect to other sources. 

Finally, the A^ baryon lifetime used throughout corresponds to the most recent LHCb 
measurement, 1.479 ± 0.019 ps 47 . The associated systematic uncertainty is estimated by 


varying the lifetime value by one standard deviation and negligible differences are found. 


8 Differential branching fraction 

The values for the absolute branching fraction of the A^ —)■ A^^ii~ decay, obtained by 
multiplying the relative branching fraction by the absolute branching fraction of the 
normalisation channel, )■ J/i^A) = (6.3 ± 1.3) x 10“^ 38 , are given in Fig. [^and 

summarised in Table where the SM predictions are obtained from Ref. 15 . The relative 
branching fractions are given in the Appendix. 

Evidence for signal is found in the region between the charmonium resonances and 
in the interval 0.1 < < 2.0 GeVYc"^, where an increased yield is expected due to the 

proximity of the photon pole. The uncertainty on the branching fraction is dominated by 
the precision of the branching fraction for the normalisation channel, while the uncertainty 
on the relative branching fraction is dominated by the size of the data sample available. 
The data are consistent with the theoretical predictions in the high-g^ region but lie below 
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the predictions in the low-g^ region. 


Table 4: Measured differential branching fraction of —)• where the uncertainties 

are statistical, systematic and due to the uncertainty on the normalisation mode, t JjipA, 
respectively. 


interval [GeVy—>■ )/dg^ ■ 10 ’^[(GeVyc'^) 


0.1 

-2.0 

0.36 

+ 0.12 
- 0.11 

+ 0.02 
- 0.02 

±0.07 

2.0 

-4.0 

0.11 

+ 0.12 
- 0.09 

+ 0.01 
- 0.01 

±0.02 

4.0 

- 6.0 

0.02 

+ 0.09 
- 0.00 

+ 0.01 
- 0.01 

±0.01 

6.0 

- 8.0 

0.25 

+ 0.12 
- 0.11 

+ 0.01 
- 0.01 

±0.05 

11.0 

- 12.5 

0.75 

+ 0.15 
- 0.14 

+ 0.03 
- 0.05 

±0.15 

15.0 

- 16.0 

1.12 

+ 0.19 
- 0.18 

+ 0.05 
- 0.05 

±0.23 

16.0 

- 18.0 

1.22 

+ 0.14 
- 0.14 

+ 0.03 
- 0.06 

±0.25 

18.0 

- 20.0 

1.24 

+ 0.14 
- 0.14 

+ 0.06 
- 0.05 

±0.26 


1.1 - 6.0 0.09 ±0.02 

15.0 - 20.0 1.20 tom -aw ±0-25 


^ 1.8 

1-6 

> 

r -- 

O 1.2 

(N 

^ 1 
i: 0.8 
< 0.6 
^ 0.4 

Xi 

< 

^ 0.2 


0 5 10 15 20 

[GeVVc4] 

Figure 5: Measured 4^—)• AfjA branching fraction as a function of with the predictions of 
the SM [15] superimposed. The inner error bars on data points represent the total uncertainty 
on the relative branching fraction (statistical and systematic); the outer error bar also includes 
the uncertainties from the branching fraction of the normalisation mode. 


SM prediction 
Data 



LHCb ^ 


■ ■ ■ I I I I I I I I I I I I I I I I I 
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9 Angular analysis 


The forward-backward asymmetries of both the dimuon system, Apu) of P'^ system, 
Apg, are defined as 




Jo 


d^r 

dq^ dcos 6i 


dcos 9i — 


dcos 9i 


dcos 6i 


dr/dg2 


(3) 


where d^T/ dq'^ dcos 0* is the two-dimensional differential rate and dT/dg^ is the rate 
integrated over the corresponding angles. The observables are determined by a fit to 
one-dimensional angular distributions as a function of cosOi, the angle between the positive 
(negative) muon direction and the dimuon system direction in the (yl°) rest frame, 
and cosOh, which is defined as the angle between the proton and the A baryon directions, 
also in the rest frame. The differential rate as a fnnction of cos 9^ is described by the 
function 


d2r(Ab ^ A£+r 

dg^ dcos 


dT 

dg2 


I (l cos^ 9t) (1 - /l) cos 9e + y /l sin^ 9e 

o 4 


(4) 


where /p is the fraction of longitudinally polarised dimuons. The rate as a function of 
cos 0/1 has the form 


d^T{Ab A(^ p7r-)i+i-) 

dg2 dcos 9h 


= B{A —)■ pn 


_,dr(Afe^ A£+r)i 


dg2 


2(1 + 2AbB 


cos9t 


(5) 


These expressions assume that A° baryons are produced unpolarised, which is in agreement 
with the measnred prodnction polarisation at LHCb 


37 


The forward-backward asymmetries are measnred in data using unbinned maximum 
likelihood fits. The signal PDF consists of a theoretical shape, given by Eqs. and 
multiplied by an acceptance function. Selection requirements on the minimum momentum 
of the muons may distort the cos 9^ distribution by removing candidates with extreme values 
of cos 9£. Similarly, the impact parameter requirements affect cos 9h as very forward hadrons 
tend to have smaller impact parameter values. The angular efficiency is parametrised using 
a second-order polynomial and determined separately for downstream and long candidates 
by fitting simulated events, with an independent set of parameters obtained for each g^ 
interval. These parameters are fixed in the fits to data. The acceptances are shown in 
Fig. as a function of cos9h and cos9£ in the 15 < g^ < 20 GeVYc^ interval for each 
candidate category. 

The background shape is parametrised by the product of a linear function and the 
signal efficiency, with the value of the slope determined by fitting candidates in the upper 
mass sideband, m[ApApr) > 5700 MeV/c^. To limit systematic effects due to uncertainties 
in the backgronnd parametrisation, an invariant mass range that is dominated by signal 
events is used: 5580 < m{Ap^p~) < 5660 MeV/c^. The ratio of signal to background 
events in this region is obtained by performing a fit to the invariant mass distribution in a 
wider mass interval. 
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Figure 6: Angular efficiencies as a function of (left) cosOi and (right) cosOh for (upper) long and 
(lower) downstream candidates, in the interval 15 <q^ < 20 GeV^/c'^, obtained using simulated 
events. The (blue) line shows the fit that is used to model the angular acceptance in the fit to 
data. 


The angular fit is performed simultaneously for the samples of downstream and long 
candidates, using separate acceptance and background functions for the two categories 



Figure 7: Angular distributions as a function of (left) cosOg, and (right) cosOh, for candidates in 
the integrated 15 < < 20 GeV^/c^ interval with the overall fit function overlaid (solid blue). 

The (red) dotted line represents the combinatorial background. 
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while keeping the angular observables as shared parameters. Angular distributions are 
shown in Fig. where the two candidate categories are combined. 


10 Systematic uncertainties on angular observables 

10.1 Angular correlations 

To derive Eqs. and a uniform efficiency is assumed. However, non-uniformity is 
observed, especially as a function of cosOh (see Fig. [^. Therefore, while integrating over 
the full angular distribution, terms that would cancel with constant efficiency may remain 
and generate a bias in the measurement of these observables. To assess the impact of this 
potential bias, simulated experiments are generated in a two-dimensional (cos 6i,cos 9h) 
space according to the theoretical distribution multiplied by a two-dimensional efficiency 
histogram. Projections are then made and are htted with the default one-dimensional 
efficiency functions. The average deviations from the generated parameters are assigned as 
systematic uncertainties. The magnitudes of these are found to be —0.032 for 0.013 
for Apg and 0.028 for /l, independently of q^. In most intervals this is the dominant 
source of systematic uncertainty. 


10.2 Resolution 

Resolution effects may induce an asymmetric migration of events between bins and therefore 
generate a bias in the measured value of the forward-backward asymmetries. To study 
this systematic effect, a map of the angular resolution function is created using simulated 
events by comparing reconstructed quantities with those in the absence of resolution 
effects. Simulated experiments are then generated according to the measured angular 
distributions and smeared using the angular resolution maps. The simulated events, before 
and after smearing by the angular resolution function, are htted with the default PDF. 
The average deviations from the default values are assigned as systematic uncertainties. 
These are larger for the Apg observable because the resolution is poorer for cos 6h and 
the distribution is more asymmetric, yielding a net migration effect. The uncertainties 
from this source are in the ranges [0.011,0.016] for Apg, [-0.001,-0.007] for App and 
[0.002,0.008] for /l, depending on q^. 


10.3 Angular acceptance 

An imprecise determination of the efficiency due to data-simulation discrepancies could 
bias the Apn measurement. To estimate the potential impact arising from this source, the 
kinematic reweighting described in Sec. 7.2 is removed from the simulation. Simulated 
samples are htted using the same theoretical PDF multiplied by the efficiency function 
obtained with and without kinematical reweighting. The average biases evaluated from 
simulated experiments are assigned as systematic uncertainties. These are larger for sparsely 
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Table 5: Measured values of leptonic and hadronic angular observables, where the first uncertain¬ 
ties are statistical and the second systematic. 


q^ interval ] 


^PB 





/l 




Ah 

^FB 



0.1 

- 2.0 

0.37 

- 1 - 0.37 
- 0.48 

± 

0.03 

0.56 

+ 

0.23 

0.56 

± 

0.08 

- 0.12 

- 1 - 0.31 
- 0.28 

± 

0.15 

11.0 

- 12.5 

0.01 

- 1 - 0.19 
- 0.18 

± 

0.06 

0.40 

+ 

0.37 

0.36 

± 

0.06 

- 0.50 

- 1 - 0.10 
- 0.00 

± 

0.04 

15.0 

- 16.0 

-0.10 

- 1 - 0.18 
- 0.16 

± 

0.03 

0.49 

+ 

0.30 

0.30 

± 

0.05 

- 0.19 

- 1 - 0.14 
- 0.16 

± 

0.03 

16.0 

- 18.0 

-0.07 

- 1 - 0.13 
- 0.12 

± 

0.04 

0.68 

+ 

0.15 

0.21 

± 

0.05 

- 0.44 

- 1 - 0.10 
- 0.05 

± 

0.03 

18.0 

- 20.0 

0.01 

- 1 - 0.15 
- 0.14 

± 

0.04 

0.62 

+ 

0.24 

0.27 

± 

0.04 

- 0.13 

- 1 - 0.09 
- 0.12 

± 

0.03 

15.0 

- 20.0 

-0.05 

- 1 - 0.09 
- 0.09 

± 

0.03 

0.61 

+ 

0.11 

0.14 

± 

0.03 

- 0.29 

- 1 - 0.07 
- 0.07 

± 

0.03 


populated intervals and vary in the intervals [0.009,0.016] for Apg, [0.001,0.007] for 
Apg and [0.002, 0.044] for /l, depending on q^. 

The effect of the limited knowledge of the polarisation is investigated by varying 
the polarisation within its measnred nncertainties, in the same way as for the branching 
fraction measurement. No signihcant effect is found and therefore no contribution is 
assigned. 

10.4 Background parametrisation 

As there is ambiguity in the choice of parametrisation for the background model, in 
particnlar for regions with low statistical significance in data, simnlated experiments are 
generated from a PDF corresponding to the best ht to data, for each interval. Each 
simulated sample is htted with two models; the nominal ht model, consisting of the 
product of a linear function and the signal efficiency, and an alternative model formed from 
a constant function mnltiplied by the efficiency shape. The average deviations are taken 
as systematic nncertainties. These are in the ranges [0.003,0.045] for App, [0.017,0.053] 
for App and [0.014,0.049] for /l, depending on q^. 


11 Results of the angular analysis 


The angular analysis is performed using the same intervals as those used in the branching 
fraction measurement. Results are reported for each q^ interval in which the statistical 
signihcance of the signal is at least three standard deviations. This inclndes all of the q^ 
intervals above the J/ijj resonance and the lowest q^ bin. 

The measured values of the leptonic and hadronic forward-backward asymmetries, App 
and App, and the /l observable are snmmarised in Table with the asymmetries shown 
in Fig. The statistical uncertainties are obtained using the likelihood-ratio ordering 
method where only one of the two observables at a time is treated as the parameter 
of interest. In this analysis nuisance parameters were accounted for using the plug-in 
method 49 . In Fig. |^the statistical uncertainties on App and /l are also reported (for the 
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Figure 8: Measured values of (left) the leptonic and (right) the hadronic forward-backward 
asymmetries in bins of (f'. Data points are only shown for (f‘ intervals where a statistically 
significant signal yield is found, see text for details. The (red) triangle represents the values for 
the 15 < < 20 GeV^/c^ interval. Standard Model predictions are obtained from Ref. 
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Figure 9: Two-dimensional 68% CL region (black) as a function of and /l. The shaded 
area represents the region where the PDF is positive over the complete cos 6i range. The best fit 
point is given by the (blue) star. 


interval 15 < < 20 GeVyc'^) as a two-dimensional 68 % confidence level (CL) region, 

where the likelihood-ratio ordering method is applied by varying both observables and 
therefore taking correlations into acconnt. Conhdence regions for the other intervals 
are shown in Fig. 10, see Appendix. 
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12 Conclusions 


A measurement of the differential branching fraction of the —)■ decay is performed 

using data, corresponding to an integrated luminosity of 3.0 fb~^, recorded by the LHCb 
detector at centre-of-mass energies of 7 and 8 TeV. Signal is observed for the hrst time 
at a signihcance of more than three standard deviations in two intervals: 0.1 < < 

2.0 GeV^c^, close to the photon pole, and between the charmoninm resonances. No 
signihcant signal is observed in the 1.1 < g^ < 6.0 GeV^c^ range. The uncertainties of the 
measurements in the region 15 < g^ < 20 GeVyc'^ are reduced by a factor of approximately 
three relative to previous LHGb measurements [^. The improvements in the results, which 
supersede those of Ref. 19 , are due to the larger data sample size and a better control of 


systematic nncertainties. The measnrements are compatible with the predictions of the 
Standard Model in the high-g^ region and lie below the predictions in the low-g^ region. 

The hrst measnrement of angular observables for the A^ —)■ A ^^decay is reported, 
in the form of two forward-backward asymmetries, in the dimuon and pvr systems and the 
fraction of longitndinally polarised dimnons. The measnrements of the A^-q observable 
are in good agreement with the predictions of the SM, while for the Apg observable 
measnrements are consistently above the prediction. 
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A Appendix 

The measured values of the brauchiug fractiou of the —)■ A/jA decay uormalised 
to A^ —)■ J/A A decays are giveu iu Table where the statistical aud total systematic 
uucertaiuties are showu separately. 


Table 6: Differential branching fraction of the >• AijA^ decay relative to >• J/ijjA decays, 
where the uncertainties are statistical and systematic, respectively. 


interval [GeV^c'^ 


1 . 1 - 6.0 
15.0-20.0 




10-3[(GeV7c' 




0.1 

-2.0 

0.56 

+ 0.20 

- 0.17 

+ 0.03 

- 0.03 

2.0 

-4.0 

0.18 

+ 0.18 

- 0.15 

+ 0.01 

- 0.01 

4.0 

- 6.0 

0.04 

+ 0.14 

- 0.04 

+ 0.01 

- 0.01 

6.0 

- 8.0 

0.40 

+ 0.20 

- 0.17 

+ 0.01 

- 0.02 

11.0 

- 12.5 

1.19 

+ 0.24 

- 0.23 

+ 0.04 

- 0.07 

15.0 

- 16.0 

1.78 

+ 0.31 

- 0.28 

+ 0.08 

- 0.08 

16.0 

- 18.0 

1.94 

+ 0.23 

- 0.22 

+ 0.04 

- 0.09 

18.0 

- 20.0 

1.97 

+ 0.23 

- 0.22 

+ 0.10 

- 0.07 


0.14 

1.90 


+0.10 

- 0.09 

+ 0.14 

- 0.14 


+0.01 

- 0.01 

+ 0.04 

- 0.06 


20 






Figure 10: Two-dimensional 68% CL regions (black) as a function of Apg and /l. The shaded 
areas represent the regions in which the PDF is positive over the complete cos Oi range. The 
best fit points are indicated by the (blue) stars. 


The two-dimensional 68 % CL regions for the observables Tpg and /l are given in 
Fig[l^ for each interval in which signal is observed. 
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